Cross-language Text Retrieval with Three Languages Cross-language Text Retrieval with Three Languages
نویسندگان
چکیده
In cross-language text retrieval, query text objects in one language are matched against a collection of text objects in another. Previous work showed that a two-language cross-language text-retrieval system can be created completely automatically by a method called CL-LSI trained on an aligned corpus of text objects. In this paper, we look at the challenge of creating a three-language cross-language text-retrieval system. We nd that CL-LSI extends easily to the three-language case when each training text object is available in all three languages (3-way aligned corpora). However, when each training text object is supplied in only two of the three languages (two-of-three pairwise aligned corpora), the natural extension of CL-LSI behaves very badly. We illustrate these observations on a large English-French-Spanish collection and introduce analysis tools that may be useful for future explorations.
منابع مشابه
Using Text Surrounding Method to Enhance Retrieval of Online Images by Google Search Engine
Purpose: the current research aimed to compare the effectiveness of various tags and codes for retrieving images from the Google. Design/methodology: selected images with different characteristics in a registered domain were carefully studied. The exception was that special conceptual features have been apportioned for each group of images separately. In this regard, each group image surr...
متن کاملKeizai: An Interactive Cross-Language Text Retrieval System
Can we expect people to be able to get information from texts in languages they cannot read? In this paper we review two relevant lines of research bearing on this question and will show how our results are being used in the design of a new Web interface for cross-language text retrieval. One line of research, “Interactive IR”, is concerned with the user interface issues for information retriev...
متن کاملThe Evaluation of Systems for Cross-language Information Retrieval
We describe the creation of an infrastructure for the testing of cross-language text retrieval systems within the context of the Text REtrieval Conferences (TREC) organised by the US National Institute of Standards and Technology (NIST). The approach adopted and the issues that had to be taken into consideration when building a multilingual test suite and developing appropriate evaluation proce...
متن کاملArabic/English Cross Language Information Retrieval Using a Bilingual Dictionary
With the increase of multilingual information available online and the increase of non-native English speaker (Arabic users) browsing the Internet, it has become more important to have information retrieval systems that can carry the retrieval process across language boundaries that is, cross language information retrieval CLIR systems. The CLIR system responds to the user query in a comprehens...
متن کاملAlternative Approaches for Cross-Language Text Retrieval
The explosive growth of the Internet and other sources of networked information have made automatic me diation of access to networked information sources an increasingly important problem Much of this informa tion is expressed as electronic text and it is becoming practical to automatically convert some printed docu ments and recorded speech to electronic text as well Thus automated systems cap...
متن کامل